Tree-Based Policy Learning in Continuous Domains through Teaching by Demonstration
نویسندگان
چکیده
This paper addresses the problem of reinforcement learning in continuous domains through teaching by demonstration. Our approach is based on the Continuous U-Tree algorithm, which generates a tree-based discretization of a continuous state space while applying general reinforcement learning techniques. We introduce a method for generating a preliminary state discretization and policy from expert demonstration in the form of a decision tree. This discretization is used to bootstrap the Continuous U-Tree algorithm and guide the autonomous learning process. In our experiments, we show how a small number of demonstration trials provided by an expert can significantly reduce the number of trials required to learn an optimal policy, resulting in a significant improvement in both learning efficiency and state space size.
منابع مشابه
Confidence-Based Robot Policy Learning from Demonstration
The problem of learning a policy, a task representation mapping from world states to actions, lies at the heart of many robotic applications. One approach to acquiring a task policy is learning from demonstration, an interactive technique in which a robot learns a policy based on example state to action mappings provided by a human teacher. This thesis introduces Confidence-Based Autonomy, a mi...
متن کاملThe Effect of Teaching through Demonstration on Midwifery Student's Self-efficacy in Delivery Management
Introduction: Active learning methods are becoming increasingly popular in midwifery students education. So the aim of this study was to determine the effect of education using demonstration on midwifery student's self-efficacy in delivery managementMethods: This quasi-experimental study was performed in 2013 in Isfahan University of Medical Sciences. Thirty midwifery students were selected thr...
متن کاملComparison of Video-Based Instruction and Instructor Demonstration on Learning of Practical Skills in Nursing Students
Introduction: Since technology has an important role in the improvement of educational quality, finding better methods of teaching and learning and improving equipment and teaching materials is emphasized. Regarding this, two educational methods- presentation by the instructor and video presentation, were offered and their effectiveness on nursing students’ learning skills was compared. Method...
متن کاملConstructing Skill Trees for Reinforcement Learning Agents from Demonstration Trajectories
We introduce CST, an algorithm for constructing skill trees from demonstration trajectories in continuous reinforcement learning domains. CST uses a changepoint detection method to segment each trajectory into a skill chain by detecting a change of appropriate abstraction, or that a segment is too complex to model as a single skill. The skill chains from each trajectory are then merged to form ...
متن کاملConfidence-Based Multi-Robot Learning from Demonstration
Learning from demonstration algorithms enable a robot to learn a new policy based on demonstrations provided by a teacher. In this article, we explore a novel research direction, multi-robot learning from demonstration, which extends demonstration based learning methods to collaborative multi-robot domains. Specifically, we study the problem of enabling a single person to teach individual polic...
متن کامل